DBD: a transcription factor prediction database

نویسندگان

  • Sarah K. Kummerfeld
  • Sarah A. Teichmann
چکیده

Regulation of gene expression influences almost all biological processes in an organism; sequence-specific DNA-binding transcription factors are critical to this control. For most genomes, the repertoire of transcription factors is only partially known. Hitherto transcription factor identification has been largely based on genome annotation pipelines that use pairwise sequence comparisons, which detect only those factors similar to known genes, or on functional classification schemes that amalgamate many types of proteins into the category of 'transcription factor'. Using a novel transcription factor identification method, the DBD transcription factor database fills this void, providing genome-wide transcription factor predictions for organisms from across the tree of life. The prediction method behind DBD identifies sequence-specific DNA-binding transcription factors through homology using profile hidden Markov models (HMMs) of domains. Thus, it is limited to factors that are homologus to those HMMs. The collection of HMMs is taken from two existing databases (Pfam and SUPERFAMILY), and is limited to models that exclusively detect transcription factors that specifically recognize DNA sequences. It does not include basal transcription factors or chromatin-associated proteins, for instance. Based on comparison with experimentally verified annotation, the prediction procedure is between 95% and 99% accurate. Between one quarter and one-half of our genome-wide predicted transcription factors represent previously uncharacterized proteins. The DBD (www.transcriptionfactor.org) consists of predicted transcription factor repertoires for 150 completely sequenced genomes, their domain assignments and the hand curated list of DNA-binding domain HMMs. Users can browse, search or download the predictions by genome, domain family or sequence identifier, view families of transcription factors based on domain architecture and receive predictions for a protein sequence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DBD––taxonomically broad transcription factor predictions: new content and functionality

DNA-binding domain (DBD) is a database of predicted sequence-specific DNA-binding transcription factors (TFs) for all publicly available proteomes. The proteomes have increased from 150 in the initial version of DBD to over 700 in the current version. All predicted TFs must contain a significant match to a hidden Markov model representing a sequence-specific DNA-binding domain family. Access to...

متن کامل

Mutational analysis of AREA, a transcriptional activator mediating nitrogen metabolite repression in Aspergillus nidulans and a member of the "streetwise" GATA family of transcription factors.

The transcriptional activator AREA is a member of the GATA family of transcription factors and mediates nitrogen metabolite repression in the fungus Aspergillus nidulans. The nutritional versatility of A. nidulans and its amenability to classical and reverse genetic manipulations make the AREA DNA binding domain (DBD) a useful model for analyzing GATA family DBDs, particularly as structures of ...

متن کامل

Dissecting the function of the adult β-globin downstream promoter region using an artificial zinc finger DNA-binding domain

Developmental stage-specific expression of the β-type globin genes is regulated by many cis- and trans-acting components. The adult β-globin gene contains an E-box located 60 bp downstream of the transcription start site that has been shown to bind transcription factor upstream stimulatory factor (USF) and to contribute to efficient in vitro transcription. We expressed an artificial zinc finger...

متن کامل

Positive and negative regulation of the cardiovascular transcription factor KLF5 by p300 and the oncogenic regulator SET through interaction and acetylation on the DNA-binding domain.

Here we show a novel pathway of transcriptional regulation of a DNA-binding transcription factor by coupled interaction and modification (e.g., acetylation) through the DNA-binding domain (DBD). The oncogenic regulator SET was isolated by affinity purification of factors interacting with the DBD of the cardiovascular transcription factor KLF5. SET negatively regulated KLF5 DNA binding, transact...

متن کامل

Lineage-specific expansion of DNA-binding transcription factor families

DNA-binding domains (DBDs) are essential components of sequence-specific transcription factors (TFs). We have investigated the distribution of all known DBDs in more than 500 completely sequenced genomes from the three major superkingdoms (Bacteria, Archaea and Eukaryota) and documented conserved and specific DBD occurrence in diverse taxonomic lineages. By combining DBD occurrence in different...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Nucleic Acids Research

دوره 34  شماره 

صفحات  -

تاریخ انتشار 2006